Improving performance of batchgenerators #113

ancestor-mithril · 2023-05-17T11:10:39Z

Thanks you for your work, this is a nice tool for augmenting 3d images.
My changes come to improve the performance speed of various methods, to reduce the cpu time spent doing augmentations.
I've fully vectorized some augmentations and normalizations, while also using inplace numpy operations where applicable. Please ask if you have a question regarding any change.

caching, optimizing conditionals, using tuples instead of lists, doing operations inplace

Using lru_cache for caching tuple creation

*unittest2 also has errors

* also adding minor improvements to utils functions (reformatting file, using lru_cache where possible)

…erands instead of transposing the higher dimensional ones

…private method in nnUNetTrainer

pandas unique is faster because it uses hashtable

again

Revisited, removed contiguous and from numpy calls and included them into the cast file

* + optimized augment brightness multiplicative

batchgenerators/augmentations/spatial_transformations.py

batchgenerators/augmentations/color_augmentations.py

FabianIsensee · 2024-01-31T18:58:06Z

I am starting to review all your changes. There is a lot of stuff, thanks a lot! Might take me a while to do all that. That must have been so much work, wow!

ancestor-mithril · 2024-01-31T22:24:13Z

You're welcome!
I added a lot of changes and a big of part is not really relevant or useful, so you can be selective about what you want to include. I hope you find some parts that can be adapted to batchgenerators.
Also, I've been validating my implementation with the unittests and nnUNet pipeline, but I'm not sure I cover all the cases.

FabianIsensee · 2024-02-01T06:26:32Z

I am not confident either about how much the unittests cover which is why I would like to go through everything before approving. You have some pretty cool tricks up your sleeve about how you approach things. That's certainly a lot cleaner than the old batchgenerators implementation.
I will also do some integration tests with nnU-Net to see if there is a degradation (or improvement ;-) ) in segmentation performance.
Have you made any performance measurements of your PR vs the current batchgenerators master? For example with the nnU-Net data augmentation as pipeline? Would be interesting

…r augmentation functions

ancestor-mithril and others added 30 commits May 16, 2023 12:01

Improved performance: np.clip and one percentile call

19a5f81

Doing inplace operations

57f21cc

Misc + small improvements, preferring tuples to lists

dbba889

Improved speed of mean_std normalization + added tests

68efcfd

Vectorized all normalizations per batch and per channel

26ca086

Gaussian noise and mean_std refactoring

7c6f377

Vectorized augment_contrast and augment_brightness_additive

659e1fb

Making convert_seg_image_to_one_hot_encoding batched and using it

0a1dd79

Various small improvements

b9c9161

caching, optimizing conditionals, using tuples instead of lists, doing operations inplace

Improving dataloader and numpy to tensor

58e324b

Setting cast to None in NumpyToTensor

f42847d

Optimizing rotations

0637882

Doing batched augment_contrast

9b9c446

Augment gamma changes

ff83845

improving per channel augment gamma

b29d74a

Added batched brightness multiplicative transform

26eb25c

Doing batched augmentation only if batches are not empty

93c08d1

Factored out the setup for multiplicative brightness

acbd7e9

Using lru_cache for caching tuple creation

Added batched implementation for Gaussian Noise Transform

bf174e5

Makeup for Gaussian Blur Transform

7369a8a

Removed unittest2 dependency

3020ec5

*unittest2 also has errors

Improved crop and pad augmentation and spatial transforms

7912a62

Improving resample augmentation and resample transform

a821dee

* also adding minor improvements to utils functions (reformatting file, using lru_cache where possible)

Misc changes for lru_cache to take effect

af70e89

Misc improvements to spatial transform

8d7e7ac

Improving vectorized computation by broadcasting lower dimensional op…

5c0794f

…erands instead of transposing the higher dimensional ones

Minor changes

3b89bf5

Solved bug with single threaded augmenter due to usage of unexisting …

f9b71d4

…private method in nnUNetTrainer

Further improving NumpyToTensor transform

b6db109

Using pandas unique instead of np unique

d9b203a

pandas unique is faster because it uses hashtable

ancestor-mithril added 13 commits August 28, 2023 16:29

Fixed augment mirroring

251e74a

again

Sorting pd.unique when needed

5cf312b

Minimizing array copy when data was already np.ndarray

8f7da40

Adjusted test crop

03e6d2c

Numpy To Tensor

d697376

Revisited, removed contiguous and from numpy calls and included them into the cast file

Replaced 'get_range_val' with 'uniform'

4a1075d

Misc

5f890e8

Faster mirroring

ffb5824

Replaced len(ndarray.shape) to ndarray.ndim

6438729

Rmoved callable feature from augment contrast and augment gamma.

8ffb23a

* + optimized augment brightness multiplicative

Using flynt for conversion to fstring

ca82730

Optimize imports

3733e75

Prefering tuple to list and dtype to astype

b81b436

ancestor-mithril commented Sep 22, 2023

View reviewed changes

batchgenerators/augmentations/spatial_transformations.py Outdated Show resolved Hide resolved

ancestor-mithril added 9 commits September 25, 2023 16:11

Using rint instead of round

1f86cc0

Merge branch 'upstream'

a7575e4

Added new TODOS

c98f477

Disabling test

57ec2fb

Removing redundant dependencies

01e0d10

More inplace np.clip

6388e43

Using and reducing memory allocations when casting to new type

f3b1117

Making resize segmentation faster without additional casting

c4eda10

Making resize segmentation faster without additional casting

d06ed91

FabianIsensee reviewed Jan 31, 2024

View reviewed changes

batchgenerators/augmentations/color_augmentations.py Show resolved Hide resolved

FabianIsensee reviewed Jan 31, 2024

View reviewed changes

batchgenerators/augmentations/color_augmentations.py Outdated Show resolved Hide resolved

ancestor-mithril and others added 2 commits February 1, 2024 13:24

Added callable retain_stats and contrast_range arguments back to colo…

6d8058c

…r augmentation functions

Merge branch 'MIC-DKFZ:master' into master

2c0958e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improving performance of batchgenerators #113

Improving performance of batchgenerators #113

ancestor-mithril commented May 17, 2023

FabianIsensee commented Jan 31, 2024

ancestor-mithril commented Jan 31, 2024

FabianIsensee commented Feb 1, 2024

Improving performance of batchgenerators #113

Are you sure you want to change the base?

Improving performance of batchgenerators #113

Conversation

ancestor-mithril commented May 17, 2023

FabianIsensee commented Jan 31, 2024

ancestor-mithril commented Jan 31, 2024

FabianIsensee commented Feb 1, 2024